o 

Oh! 

(N 
(N 



Adaptive estimator of the memory parameter and 
goodness-of-fit test using a multidimensional increment 

ratio statistic 

Jean-Marc Bardet and Bechir Dola 

bardet@univ-parisl.fr , bechir.dola@malix.univ-parisl.fr 

SAMM, Universite Pantheon-Sorbonne (Paris I), 90 rue de Tolbiac, 75013 Paris, FRANCE 

September 23, 2011 

Abstract 

The increment ratio (IR) statistic was first defined and studied in Surgailis et al. (2007) for estimating 



jy^ , the memory parameter either of a stationary or an increment stationary Gaussian process. Here three 

extensions are proposed in the case of stationary processes. Firstly, a multidimensional central limit 
theorem is established for a vector composed by several IR statistics. Secondly, a goodness-of-fit x 2 -tyP e 
test can be deduced from this theorem. Finally, this theorem allows to construct adaptive versions of 
the estimator and test which are studied in a general semiparametric frame. The adaptive estimator of 
the long-memory parameter is proved to follow an oracle property. Simulations attest of the interesting 
accuracies and robustness of the estimator and test, even in the non Gaussian case. 
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1 Introduction 

After almost thirty years of intensive and numerous studies, the long-memory processes form now an im- 
portant topic of the time series study (see for instance the book edited by Doukhan et al, 2003). The most 
famous long-memory stationary time series are the fractional Gaussian noises (fGn) with Hurst parameter H 
and FARIMA(p, d, q) processes. For both these time series, the spectral density / in follows a power law: 
/(A) ~ C A~ 2d where H = d + 1/2 in the case of the fGn. In the case of long memory process d £ (0, 1/2) 
but a natural expansion to d e (—1/2, 0] (short memory) implied that d can be considered more generally as 
a memory parameter. 

There are a lot of statistical results relative to the estimation of this memory parameter d. First and main 
results in this direction have been obtained for parametric models with the essential articles of Fox and Taqqu 
(1986) and Dahlhaus (1989) for Gaussian time series, Giraitis and Surgailis (1990) for linear processes and 
Giraitis and Taqqu (1999) for non linear functions of Gaussian processes. 

However parametric estimators are not really robust and can induce no consistent estimations. Thus, the 
research is now rather focused on semiparametric estimators of the memory parameter. Different approaches 
were considered: the famous R/S statistic (see Hurst, 1951), the log-periodogram estimator (studied firstly by 
Geweke and Porter-Hudack, 1983, notably improved by Robinson, 1995a, and Moulines and Soulier, 2003), 
the local Whittle estimator (see Robinson, 1995b) or the wavelet based estimator (see Veitch et al, 2003, 
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Moulines et al, 2007 or Bardet et al, 2008). All these estimators require the choice of an auxiliary parameter 
(frequency bandwidth, scales, etc.) but adaptive versions of these estimators are generally built for avoiding 
this choice. In a general semiparametric frame, Giraitis et al (1997) obtained the asymptotic lower bound 
for the minimax risk in the estimation of d, expressed as a function of the second order parameter of the 
spectral density expansion around 0. Several adaptive semiparametric estimators are proved to follow an 
oracle property up to multiplicative logarithm term. But simulations (see for instance Bardet et al, 2003 or 
2008) show that the most accurate estimators are local Whittle, global log-periodogram and wavelet based 
estimators. 



In this paper, we consider the IR (Increment Ratio) estimator of long-memory parameter (sec its defini- 
tion in the next Section) for Gaussian time series recently introduced in Surgailis et al. (2007) and we propose 
three extensions. Firstly, a multivariate central limit theorem is established for a vector of IR statistics with 
different "windows" (see Section [2]) and this induces to consider a pseudo-generalized least square estimator 
of the parameter d. Secondly, this multivariate result allows us to dehne an adaptive estimator of the memory 
parameter d based on IR statistics: an "optimal" window is automatically computed (see Section |3|). This 
notably improves the results of Surgailis et al. (2007) in which the choice of m is either theoretical (and cannot 
be applied to data) or guided by empirical rules without justifications. Thirdly, an adaptive goodness-of-fit 
test is deduced and its convergence to a chi-square distribution is established (see Section [3|) . 
In Section|4j several Monte Carlo simulations are realized for optimizing the adaptive estimator and exhibiting 
the theoretical results. Then some numerical comparisons are made with the 3 semiparametric estimators 
previously mentioned (local Whittle, global log-periodogram and wavelet based estimators) and the results 
are even better than the theory seems to indicate: as well in terms of convergence rate than in terms of 
robustness (notably in case of trend or seasonal component), the adaptive IR estimator and goodness-of-fit 
test provide efficient results. Finally, all the proofs are grouped in Section [5l 



2 The multidimensional increment ratio statistic and its statistical 
applications 

Let X = (Xk)keN be a Gaussian time series satisfying the following Assumption S(d,/3): 

Assumption S(d,/3): There exist e > 0, cq > 0, c' Q > and c\ £ M such that X = (X t )tei is a sta- 
tionary Gaussian time series having a spectral density f satisfying for all A € (— 7T, 0) U (0, it) 

/(A) =c Q \X\- 2d + c 1 \X\- 2d+fi + 0(\X\- 2d+ ^) and |/'(A)| < c£, \~ 2d ~ l . (2.1) 

Remark 1. Note that here we only consider the case of stationary processes. However, as it was already done 
in Surgailis et al. (2007), it could be possible, mutatis mutandis, to extend our results to the case of processes 
having stationary increments. 

Let (Ai, • • • , Ajy) be a path of X. For m <G N*, define the random variable 77?jv(m) such as 

jt> I \ 1 \\2^it=k+l ^t+m — l^t=k+l At -> + \l^t=k+m+l ~ l^t=k+m+l A ' t )\ 

N — 3m ' |/v^ fe + m V _ v^fc+wi y \l , i/v^fe+2m y _ Y^fc+ 2m Y \ \ ' 

k=0 I V2^t=fc+1 A t+m 2__,t=k+\ ■ A -t)\ + \\lut=k+m+l ^t+™ 2jt=fc+ ro +l A t) I 

From Surgailis et al. (2007), with m such that N/m —> oo and m — > oo, 



— (lR N (m)-EIR N (m)) Af(0,a 2 (d)), 
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where 



<r%d) := 2/ Cov(^#±^,^±^±4)dr (2.2) 
' \\Z d {0)\ + \Z d (l)\ \Z d {r)\+\Z d (r + l)\) V ' 



and Z d (r) := -== (% ,(r + 2) - 2 %„, 5 (r + 1) + %o. 5 (r)) (2.3) 
^/|4 d +°- 5 -4| 

with a standardized fractional Brownian motion (FBM) with Hurst parameter H G (0, 1). 

Remark 2. This convergence was obtained for Gaussian processes in Surgailis et al. (2007), but there also 
exist results concerning a modified IR statistic applied to stable processes (see Vaiciulis, 2009) with a different 
kind of limit theorem. We may suspect that it is also possible to extend the previous central limit theorem to 
long memory linear processes ( since a Donsker type theorem with FBM as limit was proved for long memory 
linear processes, see for instance Ho and Using, 1997) but such result requires to prove a non obvious central 
limit theorem for afunctional of multidimensional linear process. Surgailis et al. (2007) also considered the 
case of i.i.d.r.v. in the domain of attraction of a stable law with index < a < 2 and skewness parameter 
— 1 < f3 < 1 and concluded that IRn(jti) converges to almost the same limit. Finally, in Bardet and Surgailis 
(2011) a "continuous" version of the IR statistic is considered for several kind of continuous time processes 
(Gaussian processes, diffusions and Levy processes). 

Now, instead of this univariate IR statistic, define a multivariate IR statistic as follows: let rrij = j m, j = 1 , ■ ■ ■ , p 
with 2 < p[N/m] — 4, and define the random vector (IRnU rn))i<j< p . Thus, p is the number of considered 
window lengths of this multivariate statistic. In the sequel we naturally extend the results obtained for 
m G N* to m G (0, oo) by the convention: (IRn{j , m))i<j<p = (IRnU [ m ]))i<i<p (which change nothing to 
the asymptotic results). 

We can establish a multidimensional central limit theorem satisfied by (IRj^(j m))i<j< p : 
Property 2.1. Assume that Assumption S(d,fj) holds with —0.5 < d < 0.5 and /3 > 0. Then 



^ (lR N (j m) - E [IR N (j m)] ) 
with T p (d) — (o~ij(d))i<i.j< p where for t G M 

ai,j(d) := / Cov( 



JV(o,r p (d)) (2.4) 

<j<p [N/m\f\m— ¥oo 



\Z?{G) + Z?(i)\ \Z^(r) + Z^(r + j)\ 



dr 



\zf{Q)\+\zf{i)\' \Z^\r)\ + \Z^\r + ] )\ 
and Zf{r) = ^== _ ^ (B d+0 . 5 (r + 2j) - 2B d+0 . 5 (r + j) + B d+0 . 5 ( T )). (2.5) 

The proof of this property as well as all the other proofs are given in Appendix. Moreover we will assume in 
the sequel that T p (d) is a definite positive matrix for all d G (—0.5,0.5). 



Remark 3. Note that Assumption S(d,f3) are a little stronger than the conditions required in Surgailis et 
al. (2007) where f is supposed to satisfy f(X) = c Q \X\- 2d + 0{\\\~ 2d+fi ) and |/'(A)| < c^A- 2 ^ 1 . Note 
that Property \2.1\ and following Theorem [7] and Proposition [7] are as well checked under these assumptions of 
Surgailis et al. (2007) even if f3 > 2d + 1 (case which is not consider in their Theorem 2.4)- However our 
automatic procedure for choosing an adaptive scale ffiN requires to specify the second order of the expansion 
of f and we prefer to already give results under such assumption. 

As in Surgailis et al. (2007), for r G (—1, 1), define the function A(r) by 

2 ll + r 1 /l +r , , 2 , , , 

A r := - arctanW- + - J- log . 2.6 
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and for d G (-0.5, 1.5) let 

A (d) := A(p(d)) where p(d) := £^lz^lf^ , (2.7) 

The function d e (—0.5, 1.5) — > Ao(d) is a C°° increasing function. Now, Property 15.11 (see in Section 
O provides the asymptotic behavior of E[IR(m)] when m — > oo, which is ~E[IR(m)] ~ Ao(d) + Cm~" if 
/3 < 2d + 1, E[/i?(m)] - A (d) + Cm^logm if /3 = 2d + 1 and E[IR(m)} ~ A (d) + 0{mr^ d+ ^) if 
/9 > 2d+ 1 (C is a non vanishing real number depending on d and 0). Therefore by choosing m and N such 
as (-y/ iV ~/m)m~P — > 0, (yJV/m)m"^ log to — > and (y^V/m)rn _ ^ 2/3+1 ) — > (respectively) when to, N — > oo, 
the term E [77?( jm)] can be replaced by Ao(d) in Property 12. II Then, using the Delta-method with function 
(xi)i<i< P i-> (A(7 1 (.T i ))i< i <p, we obtain: 

Theorem 1. Lei djv(jm) := Aq 1 (IRn(j ; to)) for 1 < j ' < p. Assume that Assumption S(d,/3) holds with 
-0.5 < d < 0.5 and /3 > 0. TTiera if m ~ C N a with C > and (1 + 2/3)" 1 V (4d + 3)" 1 < a < 1 fen 



-(£v(jm)-d) A A^(o,(A^(d))- 2 r p (d)). 

TO V / l<j<p W^oo V 



(2.8 



Remark 4. If j3 < 2d+ 1, £/ie estimator d]y(m) is a semiparametric estimator of d and its asymptotic mean 
square error can be minimized with an appropriate sequence (rnjv) reaching the well-known minimax rate of 
convergence for memory parameter d in this semiparametric setting (see for instance Giraitis et al., 1997). 
Indeed, under Assumption S(d,f3) with d € (—0.5,0.5) and j3 > and if tun — [N 1 ^ 1 ^^], then the estimator 
dAr(mAr) is rate optimal in the minimax sense, i.e. 

limsup sup sup ■ E[(djv(wi/v) — d) ] < oo. 

N^oo de(-Q.5,Q.S) f£S(d,P) 

From the multidimensional CLT ()2.8[) a pseudo-generalized least square estimation (LSE) of d is possible by 
defining the following matrix: 

E N (m) := (A^djv(m))- 2 T p (d N (m)). (2.9) 

Since the function d € (—0.5,1.5) i-> er(d)/A'(d) is C°° it is obvious that under assumptions of Theorem Q] 
then 

t N (m) A (A' (d))- 2 T p (d). 

Then with the vector J p := (l)i<j<p and denoting J p its transpose, the pseudo-generalized LSE of d is: 

djv(m) := (Jp(Sjv(m)) J p ) (Ejv(m)) 1 (d N (m i )) 1 ^ p 

It is well known (Gauss-Markov Theorem) that the Mean Square Error (MSE) of djv(m) is smaller or equal 
than all the MSEs of d^{jm), j — 1, . . . ,p. Hence, we obtain under the assumptions of Theorem 12. 81 

N {d N (m)-d) A Af(0, A' (dr 2 (JpTpHd)^)- 1 ), (2.10) 

and A^d)- 2 ^- 1 ^)- 1 < A^(d)- 2 a 2 (d). 

Now, consider the following test problem: for (Xi, ■ ■ ■ ,X n ) a path of X a Gaussian time series, chose between 

• Hq: the spectral density of X satisfies Assumption S(d, f3) with —0.5 < d < 0.5 and j3 > 0; 

• Hi: the spectral density of X does not satisfy such a behavior. 

We deduce from the multidimensional CLT (|2.8|) a x 2 -type goodness-of-fit test statistic defined by: 



fjy(m) := ^ (djv(m) - d N (j m))[^ p (E N (m)) 1 (d A r(m) - d N (j m)) 
Then the following limit theorem can be deduced from Theorem [1] 
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Proposition 1. Under the assumptions of Theorem]]] then: 



T N (m) X>-1). 

jV-Kx> 

3 Adaptive versions of the estimator and goodness-of-fit test 

Theorem Q] and Proposition [1] are interesting but they require the knowledge of (3 to be used (and therefore 
an appropriated choice of m). We suggest now a procedure (see also Bardet et ai, 2008) for obtaining a 
data-driven selection of an optimal sequence (rnjv)- For d G (—0.5, 1.5) and a € (0, 1), define 

Q N (a,d) := (d N (jN a )-d)[^ p (E N (N a ))-\d N (jN a )-d) 1 ^ p . (3.1) 

Note that by the previous convention, d N (J N a ) = d N (j[N a }) and d N (N a ) = d N ([N a ]). Thus Q N (a,d) 
corresponds to the sum of the pseudo-generalized squared distance. From previous computations, it is obvious 
that for a fixed a G (0, 1), Q is minimized by dN(N a ) and therefore for < a < 1 define 

Q N {a) :=Q N (a,d N (N a )). 

It remains to minimize QAr(a) on (0, 1). However, since Sat has to be obtained from numerical computations, 
the interval (0, 1) can be discretized as follows, 

Hence, if a € _4at, it exists /eg {2, 3, ... , log[iV/p]} such that k = a log Af. Consequently, define Sat by 

Qn(Sn) ■= min Qat(«). 
From the central limit theorem (|2.8|) one deduces the following : 

Proposition 2. Assume that Assumption S(d,f3) holds with —0.5 < d < 0.5 arid /3 > 0. Moreover, if 
(3 > 2d + 1, suppose that CQ,ci,C2,d,j3 and e are such that Condition h5.13]) or {5.1$ holds. Then, 

v 1 

un — > a 



n^oo (1 + 2/3) A {Ad + 3) ' 

Remark 5. The choice of the set of discretization An is implied by our proof of convergence ofctN to a* . If 
the interval (0, 1) is stepped in N c points, with c > 0, the used proof cannot attest this convergence. However 
logiV may be replaced in the previous expression of An by any negligible function of N compared to functions 
N c with c > (for instance, (\ogN) a or alogN can be used). 

Remark 6. The reference to Condition H5.13]) or \5. 14]) is necessary because our proof of the convergence 
of oin to a* requires to know the exact convergence rate of E[/i?jv(A^ Q )] — A (d) when a < a*. When 
(3 < 2d + 1, since we replaced the conditions on the spectral density of Surgailis et al. (2007) by a second 
order condition (Assumption S{d,f3)), this convergence rate can be obtained by computations (see Property 
15. 1]) . But if (3 > 2d + 1, we can only obtain E[IR N (N a )] - A (d) = 0(TO _2d_1 ) under Assumption S(d,(3): 
the convergence rate could be slower than mT 2 ^ 1 and then q.n could converge to a' < a* (from the proof of 
Proposition^!. Condition 15.13]) and [5.14]) , which are not very strong, allow to obtain a first order bound 
for E[/i?Ar(iV Q )] — Ao(cZ) (see Proposition \5.2]) and hence to prove oln — > a*. 

JV-s-oo 

From a straightforward application of the proof of Proposition [2] the asymptotic behavior of on can be 
specified, that is, 

Pr ( tt~~aK\ < nSn < N °* ■ (log AO") — > li ( 3 - 2 ) 

V (log AT ~ ~ ' n^oo y ' 
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for all positive real numbers A and fi such that A > ( p „ 2 )(i-ct*) ano ~ P p^2' Consequently, the selected 
window mjy = N aN asymptotically growths as N a up to a logarithm factor. 

Finally, Proposition [2] can be used to define an adaptive estimator of d. First, define the straightfor- 
ward estimator d,N(N aN ), which should minimize the mean square error using Sat. However, the estima- 
tor d,N(N aN ) does not satisfy a CLT since Pr(S<Ar < a*) > and therefore it can not be asserted that 
E(ViV/iV a « (d N (N Sn ) - d)) = 0. To establish a CLT satisfied by an adaptive estimator of d, a (few) shifted 
sequence of Sat, so called 5 at, has to 

adaptive scale sequence (ttin) such as 



sequence of Sat, so called a at, has to be considered to ensure Pr(aAr < a*) — > 0. Hence, consider the 



5n „ ^ 6a N log log N 

m N := N N with a N := a N + — ^— • — ■ — . 

(p-2)(l-a N ) logN 

and the estimator 

d% R) :=d N (m N ) = d N (N s »). 
The following theorem provides the asymptotic behavior of the estimator d^ R ^ : 

Theorem 2. Under assumptions of Proposition^ 



\fw^(^ R) - d ) ^^(d)- 2 ^^ 1 ^^)- 1 ). (3.3) 



2(1 + 3/3) iVTW w Jfl) 
(p-2)/3 ' (logiV)p ' 1 N "l 



Moreover, if (3 < 2d + I, Vp > — ^ , Ann • |d£ j - d| — > 0. 



Remark 7. When (3 < 2d + 1, the adaptive estimator d N converges to d with a rate of convergence rate 
equal to the minimax rate of convergence N 1 + 2 ' 3 up to a logarithm factor (this result being classical within this 
semiparametric framework) . Thus there exists i < such that 

N^(logN) e E(d^ R) - df < oo. 

Therefore satisfies an oracle property for the considered semiparametric model. 

If ft > 2d + 1, the estimator is not rate optimal. However, simulations (see the following Section) will show 
that even if p > 2d + 1, the rate of convergence of d N can be better than the one of the best known rate 
optimal estimators (local Whittle or global log-periodogram estimators). 

Moreover an adaptive version of the previous goodness-of-fit test can be derived. Thus define 

f ( J R) := T n (N 5n ). (3.4) 

Then, 

Proposition 3. Under assumptions of Proposition^ 



4 Simulations and Monte-Carlo experiments 

In the sequel, the numerical properties (consistency, robustness, choice of the parameter p) of d N are 
investigated. Then the simulation results of d N are compared to those obtained with the best known 
semiparametric long-memory estimators. 

Remark 8. Note that all the softwares (in Matlab language) used in this Section are available with a free 
access on http://ssuiim.univ-parisl.fr/-Jean-Marc-Bardet. 
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To begin with, the simulation conditions have to be specified. The results are obtained from 100 generated 
independent samples of each process belonging to the following "benchmark". The concrete procedures of 
generation of these processes are obtained from the circulant matrix method, as detailed in Doukhan et al. 
(2003). The simulations are realized for different values of d, N and processes which satisfy Assumption 
S(d, (3): 

1. the fractional Gaussian noise (fGn) of parameter H = d + 1/2 (for —0.5 < d < 0.5) and a 2 = 1. Such a 
process is such that Assumption S(d, 2) holds; 

2. the FARIMA[p, d, q] process with parameter d such that d g (—0.5,0.5), the innovation variance a 2 
satisfying a 2 = 1 and p 7 q € N. A FARIMA[p, d, q] process is such that Assumption S(d, 2) holds; 

3. the Gaussian stationary process X^ d ^\ such as its spectral density is 

/3(A) = ^j(l + A^) for Ae[-7r,0)U(0,7r], (4.1) 

with d € (—0.5,0.5) and (3 € (0,oo). Therefore the spectral density fa is such as Assumption S(d,f3) 
holds. 

A "benchmark" which will be considered in the sequel consists of the following particular cases of these 
processes for d = -0.4, -0.2, 0, 0.2, 0.4: 

• fGn processes with parameters H = d + 1/2; 

• FARIMA[0, d, 0] processes with standard Gaussian innovations; 

• FARIMA[1, d, 1] processes with standard Gaussian innovations and AR coefficient <\> = —0.3 and MA 
coefficient 4> = 0.7; 

• X^ d '^ Gaussian processes with (3 = 1. 

4.1 Application of the IR estimator and tests applied to generated data 

Choice of the parameter p: This parameter is important to estimate the "beginning" of the linear part of 
the graph drawn by points (i, IR{im))i. On the one hand, if p is a too small a number (for instance p = 3), 
another small linear part of this graph (even before the "true" beginning N a ) may be chosen. On the other 
hand, if p is a too large a number (for instance p = 50 for N = 1000), the estimator oim will certainly satisfy 
5 at < a* since it will not be possible to consider p different windows larger than N a . Moreover, it is possible 
that a "good" choice of p depends on the "flatness" of the spectral density /, i.e. on (3. We have proceeded 
to simulations for several values of p (and N and d). Only \J MSE of estimators are presented. The results 
are specified in Tabled] 

Conclusions from Tabled it is clear that d^ R ^ converges to d for the four processes, the faster for fGn and 
FARIMA(0, d, 0). The optimal choice of p seems to depend on N for the four processes: p = 10 for N = 10 3 , 
p = 15 for N = 10 4 and p £ [15, 20] for TV = 10 5 . The flatness of the spectral density of the process does not 
seem to have any influence, as well as the value of d (result obtained in the detailed simulations). We will 
adopt in the sequel the choice p = [1.5 log (AT)] reflecting these results. At the contrary to the choice of m, 
this choice of p only depends on N and even if the adaptive scale rfiN depends on p its value does not change 
a lot when p E {10, • • • , 20} for 10 3 < N < 10 5 . 

Concerning the adaptive choice of m, the main point to be remarked is that the smoother the spectral density 
the smaller m; thus m^r is smaller for a trajectory of a fGn or a FARIMA(0, d, 0) than for a trajectory of a 
FARIMA(1, d, 1) or X^^K The choice of p does not appear to significantly affect the value of ffiN- More de- 
tailed results show that the larger d included in (—0.5, 0.5) the smaller ffiN- for instance, for the fGn, TV = 10 4 
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and p = 15, the mean of is respectively equal to 23.9, 8.3, 4.5, 4.2 and 3.8 for d respectively equal to 
—0.4, —0.2, 0, 0.2 and 0.4. This phenomena can be deduced from the theoretical study since a* = (4d + 3) _1 
in this case and therefore ffiN almost growths as JV' 4d+3 ) . 

Finally, concerning the goodness-of-fit test, we remark that it is too conservative for p = 5 or 10 but close to 
the expected results for p = 15 and 20, especially for FARIMA(1, d, 1) or X^^. 

Asymptotic distributions of the estimator and test: Figure Q] provides the density estimations of 
d% R) and f^ R) for 100 independent samples of FGN processes with d = 0.2 with N = 10 4 for p = 15. The 
goodness-of-fit to the theoretical asymptotic distributions (respectively Gaussian and chi-square) is satisfying. 



Estimated and asymptotical density of the adaptive estimator of d 




0.09 
0.08 
0.07 
0.06 
0.05 
0.04 
0.03 
0.02 
0.01 




Density {estimated and theoretical) of the adaptive goodness-of-fit test 




— Estimated density 

Chi2(14) density 
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Figure 1: Density estimations and corresponding theoretical densities of 5]y and T^ R ^ for 100 samples of 
fGn with d = 0.2 with N = 10 4 and p = 15. 



4.2 Comparison with other adaptive semiparametric estimator of the memory 
parameter 

Consistency of semiparametric estimators: Here we consider the previous "benchmark" and apply the 
estimator d N and 3 other semiparametric estimators of d known for their accuracies arc considered: 

• dhis is the adaptive global log-periodogram estimator introduced by Moulines and Soulier (1998, 2003), 
also called FEXP estimator, with bias-variance balance parameter n — 2; 

• da is the local Whittle estimator introduced by Robinson (1995). The trimming parameter is m — A/30; 

• dw is an adaptive wavelet based estimator introduced in Bardet et al. (2008) using a Lemarie-Meyer 
type wavelet (another similar choice could be the adaptive wavelet estimator introduced in Veitch et al., 
2003, using a Daubechie's wavelet, but its robustness property are quite less interesting). 

• d%f R ^ defined previously with p — [1.5 * log(A)]. 



• d]y(10) and g?at(30) which are the (univariate) IR estimator with m = 10 and m = 30 respectively, 
considered in Surgailis et al. (2007). 
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Simulation results are reported in Table [U 



Conclusions from Table [H - The adaptive IR estimator d N numerically shows a convincing convergence rate 
with respect to the other estimators. 

The estimators d/v(10) and djv(30) are clearly the worst estimators of d. This can be explained by two facts: 
1/ the numerical expression of the matrix £jv(m) is almost a diagonal matrix: therefore a least square re- 
gression using several window lengths provides better estimations than an estimator using only one window 
length; 2/ dj^ (10) and cZat(30) use a fixed window length (m = 10 and m = 30) for any process and N while 
we know that m ~ N a is the optimal choice which is approximated by ffiN- 

Both the "spectral" estimator da and dms provide more stable results that do not depend very much on 
d and the process, while the wavelet based estimator dw and djy are more sensible to the flatness of the 
spectral density. But, especially for "smooth processes" (fGn and FARIMA(0, d, 0)), dfS R ' is a very accurate 
semiparametric estimator and is globally more efficient than the other estimators. 



Robustness of the different semiparametric estimators: To conclude with the numerical properties of 
the estimators, five different processes not satisfying Assumption S(d, /3) are considered: 

• a FAR1MA(0, d, 0) process with innovations satisfying a uniform law; 

• a FAR1MA(0, d, 0) process with innovations satisfying a symmetric Burr distribution with cumulative 
distribution function F(x) — l — \ 14 L a .^ for x > and F(x) — 1 for x < (and therefore E|Aj| 2 = oo 
but E\Xi\ < oo); 

• a FAR1MA(0, d, 0) process with innovations satisfying a symmetric Burr distribution with cumulative 
distribution function F(x) = 1 — | m~p72 for x > and F(x) — 1 1+ | *p/2 for a; < (and therefore 
E|X,| 2 = oo but E\Xi\ < oo); 

• a Gaussian stationary process with a spectral density /(A) = ||A| — ir/2\~ 2d for all A € [— n,ir] \ 

vr/2}: this is a GARMA(0, d, 0) process. The local behavior of / in is /(|A|) - (7r/2)~ 2d |A|- 2d 
with d = 0, but the smoothness condition for / in Assumption S(0,j3) is not satisfied. 

• a trended fGn with parameter H = d + 0.5 and an additive linear trend; 

• a fGn (H = d + 0.5) with an additive linear trend and an additive sinusoidal seasonal component of 
period T = 12. 

The results of these simulations are given in Table |31 



Conclusions from Table \Bf The main advantage of dw and d N with respect to cIms and dn is exhib- 
ited in this table: they are robust with respect to smooth trends, especially in the case of long memory 
processes (d> 0). This has already been observed in Bruzaite and Vaiciulis (2008) for IR statistic (and even 
for certain discontinuous trends). Both those estimators are also robust with respect to seasonal component 
and this robustness would have been improved if we had chosen m (or scales) as a multiple of the period 
(which is generally known). 

The second good surprise of these simulations is that the adaptive IR estimator dYf^ is also consistent for 
non Gaussian distributions even if the function A in (|2.6|) and therefore all our results are typically obtained 
for Gaussian distributions. The case of finite- variance processes is not surprising (see Remark 2). But this is 
more surprising for infinite variance processes. A first explanation of this was given in Surgailis et al. (2007) in 
the case of i.i.d.r.v. in the domain of attraction of a stable law with index < a < 2 and skewness parameter 
— 1 < /3 < 1: they concluded that IRn(iti) converges to almost the same limit. The extension to a-stable 
linear processes of this first explanation should require technical developments but the expression of the IR 
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statistic (which is bounded in [0, 1] for any processes) could allow to apply it to infinite variance processes. 
Note that the other semiparametric estimators are also consistent in such frame with faster convergence rates 
notably for the local Whittle estimator. 



5 Proofs 

Proof of Property 1 2. li We proceed in two steps. 

Step 1: First, we compute the limit of ^ Cov(lRN(jm), IRN(j'm)) when N, m and N/m — > oo. As in 
Surgailis et al (2007), define also for all j = 1, • • • ,p and k = 1, • • • , N — 3m j (with rrij = jm): 



k+rrij k+rtij ^ 

W*0 : = v~ E ( X t+ m] -X t ) , with V*. :=E[( £ (X t+m .-X t )) ] (5.1) 



'3 * = 



t=fc+l t=k+l 



\Y m .(k) + Y m Ak + mj)\ 

and ^ (fc) ' = (5 - 2) 



Note that F mj (A;) ~ A/"(0, 1) for any A; and j and 

JV-3m,-l 

N - 3m,-' 



^ l\—imj—L 

IR N {mj) = — - ^ r\ mj (k) for all j = 1, • • -p. 



k=0 



Cov(IR N (m j ) ) IR N (m j *)) = E Cov(r) mj (k),r) m .,(k'))) 



N — 3m, N — 3m j 

J J k=0 k'=0 



(^7 ~ 3 )(— Jt=0 Jt'=0 



N—l n Af-1 



Cov(r] m] ([rrijT}), r) m ., ([m^r']))) rfr dr. 



Now according to (5.20) of the same article, with — >fdd denoting the finite distribution convergence when 
m — > oo, 



Y m ([mT}) — >fdd Z d (r) 



where Z d is defined in (|2.3|i . Now 



jm jm 

Yj m (k) = — — Xt+jm+i — y^^x t+ ix t ) 

Vm l t=l {=1 
1 

= 17- E (i-KI)Kn^ m (*+(i + i-l)m). 

But ~ cqV (d)m 2d+1 when m — ?> oo (see (2.20) in Surgailis et al, 2007). Therefore we obtain Yj m ([mjr]) ~ 
j d + 1/2 5Zi=— (j— 1)0' ~~ l*l)^m([ OT i' r ] + (i + ^ — l)m) when m — > oo (in distribution) and more generally, 

(y,- m ([mjr]), Y,-/ m ([mjV]) — S> f _d_d 

-L^ £ + E U'-li'DUfr'+f + i'-l)), (5.3) 

when m — > oo. Hence, obvious computations lead to define for (el 

Zjp{t) :=Y (j \i\)Z d (t+j+i i ) _B d+0 . 5 (t + 2j)-2B d+ o. 5 (t + 3) + B d+0 . 5 (t) 

7 ^" ) (i) := Cov(^\o),Z^\j))^(Z i /\t),Z d f) (t+f))). (5.5) 
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Now, as the function tp(x,y) = rfepra is a continuous (on R 2 \ {0,0}) and bounded function (with < 
i/)(x,y) < 1) and since ^([toj't]) = ip(Y m:j ([mjT]),Y mj ([mj(T + 1)])), then from ([O]), 



C0Y{ Vm] {[ m] T]), Vm3 ,{[m r r'])) Cov^(^' ) ( J r) ; ^ ) ( J (r + l)))^(zf ) (/r'),^' ) (/(r' + l)))) 

using the stationarity of the process Zd and therefore of processes Z d and Z d . Hence, when N, m and 

N/m — > oo, 

AT AT 
-Cov^Om),/^^™)) ~ - 3 )(^-3) 

N-l _ g N-l _ g 

x f *" [^Coy^(Z^{jr),Z^(jr + j))MZ d j ' } (f r'), Z ( /\f r' + f)))drdr' 

mN f~ — 03 f~ — 3j ' 1 ■ '\ 

' ' 7^"V -s)dsds' 



Jo 



(N-3jm)(N-3j'm) J J 

N_ 

ff/.j(s-i-i)-'? rt w* 

1 ( d "'\u)du=:a u ,(dy (5.6) 

This last limit is obtained, mutatis mutandis, from the relation (5.23) Surgailis et al (2007), and thus 
Id' 1 \ u ) — C {u~ 2 A 1), implying ^ f™ K \u\-f d J ' j \u) du — > 0. It achieves the hrst step of the 

N, rn, ~->-oo 

proof. 

Step 2: It remains to prove the multidimensional central limit theorem. Then consider a linear combi- 
nation of (Ji?jv(mj))i< :) <p, i.e. Sj=i u j -f-R/v( TO j) with (u\,--- ,u p ) € K p . For ease of notation, we will 
restrict our purpose to p = 2, with m,; = r^m where ri < r2 are fixed positive integers. Then with the 
previous notations and following the notations and results of Theorem 2.5 of Surgailis et al. (2007): 

u\ IRN{r\m) + U2 IRN{r 2 m) = u 1 (E[IR N (r 1 m)] + Sjf(rim) + Sjf (rim)) 

+ u 2 (E[IR N (r 2 m)} + S K {r 2 m) + S K {r 2 m)). 

From (5.31) of Surgailis et al. (2007), we have Sif(mi) = o(SK(mi)) and SK(jn 2 ) = 0(6*^(7712)) when 
K — > 00 and from an Hermitian decomposition (N/m) 1 ^ 2 (uiSK('mi) + u 2 Sk{">tl2)) -^d N(0,Jk) as N, m 
and N/m — > 00 since the cumulants of (N /m) 1 l 2 {u\SK{' , m-i) + u 2 SK{'m-2)) of order greater or equal to 3 
converge to (since this result is proved for each 5r<(mi)). Moreover, from the previous computations, 
7jL — > (u 2 a ritri (d) + 2uiu 2 a rit r 2 (d) + U2<7 r2t r 2 (d)) when K — > 00. Therefore the multidimensional central limit 
theorem is established. □ 

Property 5.1. Let X satisfy Assumption S(d,f3) with —0.5 < d < 0.5 and /? > 0. Then, there exists a 
constant K(d, j3) < depending only on d and /3 such as 
E[lR N (m)] = A (d) + K(d,f3)xm-P + 0(m-P- £ + m- 2d - 1 \og(m)) if-2d+/3<l, 

= A (d) + K(d,f3) x m-V log(m) + 0(m- p ) if-2d+/3 = l; 

= A {d) + O(m- 2d - 1 ) if-2d + [3>l. 

Proof of Provertv \5.1[ As in Surgailis et al (2007), we can write: 
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Therefore an expansion of i? m /V^ will provide an expansion of E[/i?/v( m )] when m — > oo and the multidi- 
mensional CLT (12.81) will be deduced from Delta-method. 



Step 1 Let / satisfy Assumption S(d,f3). Then we are going to establish that there exist positive real 
numbers C\ and C2 specified in (|5.7j) and (|5.8[) and such that: 

p(d) + d{-2d,l3) m-P + 0{m- l3 - £ + m- 2d - 1 \ogm) 

p(d) + C 2 (l-j3,j3) m-' 3 log m + O(m- ) 

p(d) + Oim- 2 *- 1 ) . 



1. if-l< -2d< 1 and -2d + < 1, 

2. if-l< -2d< 1 and -2d + = 1, 

3. if -1< -2d < 1 and -2d + (3 > 1, 



m 

v 2 

m 
R"m 

v 2 



Indeed under Assumption S(d,(3) and with Jj(a,rn), j — 4,6, defined in (|5.23j) of Lemma [STTI (see below), it 

IS clGcLr tjlitltj 

J 6 (-2d, m) + I J 6 (-2d + f3,m)+ Q(J 6 (-2d + g + e)) 
V;2 ~ J 4 (-2d,rn) + ^J 4 (-2d + £,m) +0(J 4 (-2d + /3 + e))' 

since / 0(x" 2d+/3+e ) ^-^-dx = 0{JA-2d+ p+s)) for j = 4, 6. Now wc follow the results of LemmaO 

Jo sin (f ) 

1. Let -1 < -2d + < 1. Then for any e > 0, 

#m . „ C 6 i(-2d)m 1 + 2d +C 62 (-2d)+^(C 6 i(-2d + /3)TO 1 + 2d -^+C 62 (-2d + /3))+0(m 1 + 2d -' 3 - £ +logm) 



= 1-2 



C41 (-2d)™ 1 + 2d +C 4 2 (~2d)+f - (C 4 i (-2d + /3)m 1 + 2d -' 9 +C , 4 2 (-2d + j 8))+0(m 1 + M -* , - < 4-log m) 



1- 



C 41 (-2d) 



C 6 i (-2d)+— C 6 i (-2d + z?)™-' 3 
co 



= i 2C 61 (-2d) ci [■ C 61 (-2d)C 41 (-2d + /3) C ei {-2d + 0) 



, Cl C 41 (-2d + /3) 

1 771 " 

c C 41 (-2d) 



0(7 



-0(m _/3 ~ e +m~ 



2d-l 



log m) 



C 4 i(-2d) c Q I C 4 i(-2d)C* 4 i(-2d) C 4 i(-2d) 
As a consequence, with p(d) defined in (|2.7p and Cji defined in Lemma |5.1 



loe 



i? 



-^-=p{d) + d(-2d,P) m~P+ 0[m-P- £ + 



V 2 



-2d-l - 



logm) (m — > 00), with 



d(-2d,/3) :=2 



ci 1 
c Cl 1 (-2d) 



[Cei (-2d)C 4 i (-2d + p) - C 6 i (-2d + /3)C 4 i (-2d)] , (5.7) 



and numerical experiments proves that C\ (— 2d, f3)/c\ is negative for any d € (—0.5,0.5) and /3 > 0. 
2. Let -2d + fi = 1. 
Again with Lemma l5.1[ 



•0(r 



= {C 61 (-2d)mP + C^logjjnn) + C 62 (-2d) + f^ 2 + 0(1)] 

V 2 [C 41 (-2d)m^ + Ck£fo fl (""0 + C 42 (-2d) + fC' i2 + 0(1)] 



= 1 - 
= 1 - 
As a consequence, 



1 ■ d,_„/„^„-e 1ri / C41 Cl iog(m))m-' 3 ] 



co ^42 
^41 



[C 61 (-2d) + (0 61 ^log(m))m^][l-( 
041(0; c o 4 i(a; c 

o 61 (-2d) - ^(%h^gk _ ai) log(m) m -,i + 0(m -^ 



C 4 i(-2d) 



c v On (-2d) 



- p(d) + C 2 (-2d,l3)m~ p logm + O^i" 13 ) (m ->■ 00), with 

* m 

C 2 (-2d,/3) := 2 ^ (C^g a i(-2d) - 6^641 (-2d)), (5.8) 
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and numerical experiments proves that C2(— 2d, /3)/cj is negative for any d £ (—0.5,0.5) and fi = 1 — 2d. 

3. Let -2d + (3 > 1. 

Once again with Lemma 15.11 

Rrn 9 [C sl (-2d)m 1 + 2d + C 62 (-2d) + fC^{-2d + (3) + %G'^-2d + f3)m^^ + 0(1)] 

V 2 



2 [C 81 (-2d) + O^- 2 ^ 1 )] [1 - Ofa- 2 *- 1 )] 



On (-2d) 
2C 6 i(-2d) 



01 



-2d-l\ 



C 41 (-2d) 

Note that it is not possible to specify the second order term of this expansion as in both the previous cases. 
As a consequence, 



01 



-2d-l 



) (m — > oo). 



Step 2: A Taylor expansion of A(-) around p(d) provides: 

r9A 



A (f?) 



A(p(d)) 



dp 



(5.9) 



(5.10) 



Note that numerical experiments show that 



<9A 



dp 



(p) > 0.2 for any p e (—1, 1). As a consequence, using the 



previous expansions of Rm/V^ obtained in Step 1 and since E[/i?/v(m)] = K.(R m /Vfy , then 



ciC((d,/3)m- 



-P-e . 



-2d-l 



log m + m 



E[lR N (m)] = A (d) + < ci C 2 (/3)m-' 3 logm + O(m^) 



0(r 



-2d-l"\ 



if (3 < 1 + 2d 
if j8 = 1 + 2d 
if /3 > 1 + 2d 



with C[{d,(3) < for all d e (-0.5, 0.5) and f3 € (0, 1 + 2d) and C^(/3) < for all < (3 < 2. 



□ 



Proof of Theorem Using Property O if m ~ C A Q with C > and (1 + 2/3)" 1 A (4d + 3)" 1 < a < 1 then 
^/ NJ m (E [JiZjy — Ao(d)) — >• and it implies that the multidimensional CLT <\2A\ can be replaced 

by 



N- 



M(lR N (r 
V m V 



A (d) 



l<j<p AM-oo 



AT(0,r p (d)). 



(5.11) 



It remains to apply the Delta-method with the function Aq" 1 to CLT (|5.11[) . This is possible since the 
function d — > Ao(d) is an increasing function such that A' (d) > and (Aq 1 )'(Ao(d)) = 1/A' (d) > for all 
de (—0.5, 0.5). It achieves the proof of Theorem [TJ □ 

Proof of Proposition^ For ease of writing we will note Ejv instead of Yliy(N a ) in the sequel. We have 
(djv(m) — djv(j to)) 1< •< = Mjv(div(j to) — d) 1< •< with A/at the orthogonal (for the Euclidian norm || • ||g w ) 
projector matrix on ((l)i<i<p) (which is a linear subspace with dimension p — 1 included in K p ) in K p , i.e. 



= Jp(JpS Ar 1 J p ) 1 JpE^y 1 . Now, by denoting a symmetric matrix such as T,^ £ 



a/2 v i/2 



\\(d N (m) - d N {j m)) x< ,< ||| = {d N {jm) - d)' x< M N S N 1 M N (d N (j m) - d) 1 



<i<p 
{A N Z N )'{A N Z N ) 



<j<p 



with A^r = Ejy^M/vI]^ 2 and Z/v a random vector such as ^/ N jmZjq 



U/2 



JV- 



Af p (Q,Ip) from Theorem ED 



But we also have An — Jp(J p Yjj^ Jp) 1 J p Y* 



Hn(H' n Hn) 1 H' n with i/jv = Yj^^Jp a matrix 



AT 
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of size {p x (p — 1)) with rank p — 1 (since the rank of J p is (p — 1)). Hence is an orthogonal projec- 
tor to the linear subspace of dimension p — 1 generated by the matrix Hjy. Now using Cochran Theorem 
(see for instance Anderson and Styan, 1982), */ N/m An Zm is asymptotically a Gaussian vector such as 
N/m(A N Z N )'(A N Z N ) A x 2 (;p-l). □ 



In Property [CT] a second order expansion of E[/i?Ar(m)] can not be specified in the case ft > 2d + 1. In the 
following Property 15.21 we show some inequalities satisfied by E[/i?jv(™)] which will be useful for obtaining 
the consistency of the adaptive estimator in this case. 

Property 5.2. Let X satisfy Assumption S(d,f3) with —0.5 < d < 0.5, ft > 1 + 2c?. Moreover, suppose that 
the spectral density of X satisfies Condition &5.13\) or \5. 1J$ . Then there exists a constant L > depending 
only on Co, Ci, C2, d, ft, £ such that 

\E[lR N (m)] - A (d)\ > Lm- 2d -\ (5.12) 

Proof of Provertv \5.S\ Using the expansion of Jj(a,m), j — 4,6, for a > 1 (see Lemma 15. ip and the same 
computations than in Propertv l5.il we obtain: 

- r<2 , 2 ... [(C 62 (-2d)C 41 (-2d)-C 42 (-2d)<7 6 i(-2d))+-(^ 
o 41 (— 2a) l co 

+ — (CS'i(-2d + /3 + e)C 4 i(-2d) + C^(-2d + /? + e)C , 6 i(-2d))l m _!W - 1 (l + o(l)) 
Co J 

<W"P(«0< 

(C 62 (-2d)C 4 i (-2d) - C 42 (-2d)G>i (-2d)) + - (C& (-2d+ft)Cu (-2d) - C 41 (-2d+£)C 61 (-2d)) 

co 

l C 2| /W/ / r> I , /I , / n.n , sill / r, j . n , . \ ^ / o A \ 1 — 2d— 1 / 



C| 1 (-2d) 

Now, denote 
D (d) : 



co 



• (C|Ji(-2d + /? + £)C 4 i(-2d) + C 4 ' 1 (-2d + /3 + e)0,i(-2d)) m" 2 ^! + 



C 62 (-2d)C 41 (-2d) - C 42 (-2d)C 61 (-2d) = ^/^iL)^ 5 - 32+M )' 

^(d,/3) := C 62 (-2d + ft)C A1 (-2d) - C 42 (-2d + ft)C 61 (-2d) = ^^±^^ ( 2 4 + 2rf - 5 - 3 2 + 
D 2 (d,ft,e) := ^' 1 (-2d + /3 + e)C 4 i(-2d) + C 4 ' 1 (-2d + /3 + e)C 61 (-2d). 

Since -0.5 < d < 0.5, 2 4+2d - 5 - 3 2+2d > and 1 - 2- 1+2d > 0. Moreover, from the sign of the constants 
presented in Lemma I5TT1 we have D (d) ^ except for d = 0, D\(d, ft) ^ except for d = 2ft and D 2 (d, ft, e) > 
for all d € (—0.5, 0.5), /3 > and e > 0. Therefore, if cq, c\,c 2 ,d, ft, e are such that 

K x := D (d) + — Si(d,j8)-^D 2 (d,j9,e)>0 (5.13) 
Co c 

or JT 2 := D Q (d) + -D 1 (d,ft) + ^D 2 (d,ft,e) <0. (5.14) 
co c 

and from the signs of Do(d), D\(d,ft) and D 2 (d, ft,e), when (d,ft,e) is fixed, these conditions are not im- 
possible but hold following the values of ^ and Then — p(d) < — 2 1 TO~ 2d ~ 1 or — j- — 



p(d) > — 2 — 7TK m f° r m large enough following f|5 . 13[) or f|5 . 14[) holds. Then, if (|5.13[) holds, since 

C^iv 2d) 

E[I.R;v(to)] = A(-^f-), since the function r — > A(r) is an increasing and C 1 function and since E[/i2jv(m)] = 
R 

A( — ^) then when m large enough, from a Taylor expansion, 



E[Ji^(m)] < A(p(d) - m" 2 "" 1 ) E[/i? A r(m)] < A (d) - ^A>(d)) A 1 



C| 1 (-2d) 7 2 ^ w ^C 2 1 (-2d) 



■m- M - 1 . 
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Now following the same process if (|5.14p holds, we deduce inequality (|5.12[) . □ 
Proof of Proposition^ Let e > be a fixed positive real number, such that a* + e < 1. 

I. First, a bound of Pr(5jv < a* + e) is provided. Indeed, 

Pr(a N <a*+s) > Pv[Q N (a* + e/2) < min Qn{&) ) 

^ a>a*+e and a<=A N ' 

> 1-Pr( |J Q N {a* +e/2)>Q N { 

a > a *+e and 
log [N/p] 



k=[(a*+e)lo S N] 

But, for a > a* + e, 



; I- J] Pr(Q w (a*+ £ /2)>Q i v( i ^)). (5.15) 



Pr ^ 

Pr 



■ ■ 2 

'l<i<pllsjv(W a ) 



(QAr(a*+e/2)>Qjv(a)) 

(||(d Ar ( l iV Q *+^)) 1 ^ p -^v(iV«*+^ 2 )||| jv(JVQ , +e/2) > \\(d N (iN a )-d N (N a )) } 

with = X'Q-'X. Set Z*(a) = ^ || (d N (i N*))^.^ - d N (N<*)\\l N(Nay Then, 

Pr(Ow(a* + e /2) > Qiv(a)) = Pr (Zjy(a* + e/2) > jv«-(<**+*/2) Z N {a)^j 

< Pr (z^a* + e/2) > iv( Q -( Q *+ £ / 2 ))/ 2 ) + Pr (z N (a) < N -^-^+e/2))/2^ 

From Proposition [TJ for all a > a*, Zat(o:) — — > x 2 (p — 1). As a consequence, for N large enough, 

N-Hx> 



Pr 



rfZvfa) < Ar-(«-(«*+^/2))/2 N ) < ? ^ } (a - (Q . +e/2)) 



2(p- 1 )/ 2 r((p- i)/2) 

Moreover, from Markov inequality and with TV large enough, 

Pr (z N (a* + e/2) > N ^-( a '+e/2))/2^ < 2 Pr (exp( y/ x 2 (j> - 1) > exp (jv(a-(«*-W2))/^ 

< 2 E(exp(Vx 2 b- 1)) cxp ( - N^^'+em)/^ 
We deduce that there exists Mi > not depending on N, such that for large enough N, 
Pr (Qn{oi* + e/2) > Q«(a)) < M x exp ( - jv( Q ~( a * +£ / 2 »/ 4 ) . 



since E(exp( Ak /x 2 (P — 1)) < 00 does not depend on N. Thus, the inequality (|5.15|) becomes, with M 2 > and 
for N large enough, 

log[N/p] -[(a*+e) log AT] 

Pr(aAr<a*+e) > 1 - Mi e' N '' & ^ exp(-7V^t«) 



fc=0 



r/8 



> 1 - M 2 e~ N . (5.16) 

II. Secondly, a bound of Px(aN > a* — e) can also be computed. Following the previous arguments and 
notations, 

Pr(a N >a*-e) > Pr (Q N (a* + ^-J-e) < min Qjv(a)) 

[( Q *- £ )logiV]+l , 

> i- E Pr ^^ + ^ £ )>^(waf) ' ( 5 - 17 ) 
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and as above, with Z N (a) = \\(d N (iN a ) - dN(N a )) 1<i<p \\^, Nct y 

Pr (Q N (a* + i^e) > Q N (aj) = Pr (z N (a* + ^-e) > N a ^ a ' + 1 ^^ Z N (a) 



2a* 



(5.18) 



1 - a* 
2a* 

if /3 < 2d+ 1, with a < a* = (1 + 2/3)" 1 , from Property IO and with C ^ 0, for 1 < i < p, 
J - (E[lR(iN a )] -A (d)) ~ C i ^ (1 ^ Q * )/2Q * ^("'-"'/^'(logTV) 1 ^ 2 ^ 1 

^ (Aj7 1 (E[ZJZ(t.tf a )]) -d) ~ c'r^ 1 - *'/ 2 "* ArK-^/^^iogiV)^^ 1 (5.19) 
with C ^ 0, since A (d) > for all d G (-0.5,0.5). We deduce: 




(d^AT) -d)^ ^ =s (7''A^ (Q *^ Q)/2a *(logA^) 1 ^= M + 1 (r( 1 ^ a *^ 2Q *) 1<i<p + (eAr^A^")) 



i<*<p' 



with C" ^ and (ejv(« TV")) A/" 0, (A^(d))~ 2 r p (d) from Proposition [T] Now from the definition 

of d N (N a ), we have (djy(i N a )-d N (N a )) 1<i<p = M^v (5jv(« A^ Q ) -d) 1<i<p with M N the orthogonal projector 
matrix on (l)] L < i < p . 

As a consequence, for a < a* — e and with the inequality ||a — 6|| 2 > ^||a|| 2 — ||6|| 2 , 



Z N {a) > ~(C") 2 N^ (log 2 N) 1 ?^ Mvii-^)^ 



£ N (^)- ||M ^ (liVQ))ll L(-)- 



Now, it is clear that \\M N e N (i N a ))\\^ . < \\e N (i ^ a ))ll| N(ArQ) < Ci when ^ lar ge enough, with C\ > 
not depending on AT. Moreover the vector (i~ 2 Q * ) 1< . < is not in the subspace (l)i<j< p and therefore 



l<i<p 



> C2 for A^ large enough with C2 > 0. We deduce that there exists D > such 



that for N large enough and a < a* — e, 

Z N (a) > D N 2 ^^ (log 2 N) lfi = 2d+1 . 



Therefore, since N 



AT->oo 



00 when a < a* — e, 



y'y{ Z N (a) > ^DN^ 



1. 



Then, the relation (|5.18[) becomes for a < a* — e and A~ large enough, 

I -a* 



Pr 



(Qn( 



a + 



2a* 



-e)>Q N (a)) < Pr 



) < Pr( X 2 b-l)> (^DN^) 



2 
D 



N 



< Pr 

with M 2 > 0, because ^=^-(2(a* — a) — s) > 1 9a "* g for all a < a* — e. Hence, from the inequality (|5.17[) . for 
large enough N, 

Pr(a N >a*~e)>l~M 2 \ogN N-^-^ 1 ^ E . (5.20) 
• if p > 2d+ 1, with a < a* = (4d + 3)- 1 and from Property O we obtain an inequality instead of (15.191) : 



A^(E[IR N (m)}) - d > -(AoCrf))- 1 ^! 



-2d-l 



since the function x n- A 1 (x) is an increasing an C 1 function, using a Taylor expansion. Therefore for 
1 < i < P, 



A^ 1 (E [lR(i AT")] ) -d > i(A (d))^ 1 Lr( 1 ^ Q *)/ 2a * atK-°0/ 2 «\ 



(5.21) 
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Now, as previously and with the same notation, 

(d N (iN a ) - d*(J\T ))!<,•<,, - M n (A^(E[lR(iN a )]) - d) i<i<p + M^e^iN*))^. (5.22) 

Now plugging (|5.2ip in (|5.22[) and following the same steps of the proof in the case j3 < 2d+ 1, the same kind 
of bound (|5.20p can be obtained. 

Finally, the inequalities (fSTTrJf and (|530|) imply that Pr (\a N - a*\ > e) — ► 0. □ 

TV— foo 

Proof of Theorem^ The results of Theorem [5] can be easily deduced from Theorem[T]and Proposition [5] (and 
its proof) by using conditional probabilities. □ 

Proof of Proposition [Sj Proposition [3] can be deduced from Theorem [2] using the same kind of proof than in 
Proposition [T] and conditional distributions. □ 

Lemma 5.1. For j = 4,6, denote 

Ji{a,m):= x a , 2 , 2 ' dx. (5.23) 
Jo sm z (|) 

Then, we have the following expansion when m oo: 

1. if-l<a<l, J 3 (a,m) = C jl (a)m 1 - a + C :i 2(a) + O(m- 1 -^ aA0 '>): 

2. ifa= 1, Jj(a,m) = C' jX log(m) + C' j2 + 0(m _1 ); 

3. ifa>l, J 3 (a,m) = C'jiia) + 0(m 1 - a + m- 2 ), 

where constants Cji(a), Cj2(a), Ci-yia), Cj 2 {a) and C'^a) are specified in the following proof. 
Proof of Lemma \5.1[ 1. let —1 < a < 1. 

We begin with the expansion of Ji(a,m). First, decompose Ji(a,m) as follows 

Ma, m) = 2 a+1 f 2 y a sm 4 (my) [— Jtt ~ "4l d V + / 7^2 sm\^)dx. (5.24) 
Jo L sm (y) y z i Jo (2) 2 

Using integrations by parts and sin (|) = sin 2 (|) — isin 2 (ir) = |(3 — 4cos(y) + cos(2y)), we obtain for 

m — > 00: 

rW 8 " 4 ^^ = ^--((1-^) ^^^dy-lj^j^^-i^+^y^dy 

^(1-2^) _l-a o 1 



(1 - a)r(l - a) sinfc^) 2(1 - a) ' 



+ 0(m" 1 ) 



2 

where the left right side term of the last relation is obtained by integration by parts and the left side term is 
deduced from the following relation (see Doukhan et al. 2003, p. 31) 



1 7T 

y a sm(y) dy = - — for < a < 2. (5.25) 
2 r(a)sm(7r(f )) 

Moreover, with the linearization of sin 4 u and Taylor expansions ^—k-rr — \ ~ h and 1 _ cos fa) ^ JL 

' J * sai 2 (y) y 2 a _j. 3 y 3 sm s (y) y ^ Q 15' 

a+1 



->a+l 



y«sm 4 (my)\-l— -±]dy= 3^— / 2/ a [— ^ ■ - \]dy + Ofa" 1 -^) . (5.26) 

LsuT(y) y 2 i 8 J sm z (y) y 2 

Finally, by replacing this expansion in (|5.24p . one deduces 
f w sin 4 ( — ) 

J 4 (a,m)=/ x a ^-^dx^C 41 (a)m 1 - a + C 4 2(a) + O(m- 1 - ( - aA0 '>) (m -> oo), with 

Jo sin (§) 

Gu(o):= 7T( - 1 ~ and C 42 (a) := /V[^ \]dy - , 3 r Tr"" 1 . (5.27) 

W (l-a)r(l-a)sin((i^>) W ./„ [ sm 2 (y) y 2j 2(1 - a) 1 ' 
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Note that C 41 (a) > and C 42 (a) < for all < a < 1, C 42 (a) > for all -1 < a < 0, C 42 (0) = 0. 



A similar expansion procedure of Je(a, m) with sin 6 (^) instead of sin 4 (^) can be provided. As previ- 
ously with sin 6 (|) = ^(10 — 15cos(y) + 6cos(2y) — cos(3y)), when m — > oo, 

Ma,m) = C 6 i(a) m 1 "" + C 62 (a) + 0{m~ 1 -^) , 

7r(15 + 3 1_a — 2 1 ~ a 6) 5 

with C 6 i(o) := , . 7^7- tt and C 62 (a) := t C 42 (a). 

16(1 — a)l (1 — a) sin(^(l — a)) 6 

Moreover it is clear that Cei(a) > 0. 

2. let a = 1. 

When m^cowe obtain the following expansion: 

rzsin 4 (2f) w i-™ sin(2a; ) _ 2a; (""sinW-i n ft . 4 / 1 In 

/ . 2 ) x \ dx = - / v y cte-4/ — ^ dx) + 4 y sin 4 my -37- - — )dj 

Jo sin 2 (f) 2V7 2x 2 7 x 2 / 7 v sm 2 (y) y 2 / 

But, 

sin(2x) - 2x , „ /" m7r sin(a;) - x , 3/ . . f°° siny , f 1 siny - y \ _ u 

Moreover from previous computations (see the case a < 1), 

/ ysin 4 (my)( — -j— - — )dy = - / y( , „ - - — )dy + Ofm' 1 ). 
7 V sm 2 (y) y 2 / 8 7 V S m 2 (y) y 2 / 

As a consequence, when m — > 00, 

f n xsin 4 !' — ) 3 
/ . 2 ' dx = C 'n lo S( m ) + C 42 + Oim- 1 ), with C 41 := - and 
Jo sin (2) 2 

Note that C 41 > and C 42 ~ 2.34 > 0. 

In the same way , we obtain the following expansions when m — > 00, 

XSm 2 ! - g 6i log(m) + C£ 2 + 0{m- 1 ) with CT 61 :=^ and 

C£ 2 := -\og{Tt)+- [' y(-l—-L)dy+l [ ±(-cos(3y)+6cos(2y)-15cos(y))dy+4 f I s in 6 (|)dy. 
4 4 7 Vsm 2 (y) y 2 / 8 J 1 y\ 1 J y 2 

Note again that C' 61 > and numerical experiments show that C 62 > 0. 

3. Let a > 1. Then, with the linearization of sin 4 (u), 
r« z°sin 4 (2f) 3 r x a 1 <"* - a 



r/sin"^), 3 r x a , 1 r a; a . . , 1 r x a . . 

/ = — - — ax = - — r; dx / — = cos mi ffliH — / — = cos(2mx)ax 

Jo sin 2 (|) 8j sin 2 (|) 2 J sin 2 (f) V ' 8 J Q sin 2 (|) V ' 

^11 , % 1 /sin(mx) sin(2mx)\/ , ,N 

= C 4i («) + -/ ( 2 ^-^J^g^ + M^j^, (5-28) 



with: 3(1) = (-^1 - 4ax a -A - 8x a ^) and h(x) = (4a 8)x^. 

Nsin (|) / V sin (|) / 

First, if 1 < a, with an integration by parts, 

1 ,sin(ma;) sin(2mx) N , , . , n 9N 

-J i^- 1 \^)h(x)dx = Oim^ + m- 2 ). (5.29) 
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Moreover, 

1 f* fsm(mx) sin(2mx)~ 



m 



I (—2 16—)^ 



1 (— l) m w 9 „\ „ <t 1 1 /" w / cosfmx) cos(2tox) n .. , , 
' -)( a ^-4a + 8)7r a - 3 — -— / ( ^+ \ n ) )g'(x)dx 



v 32 2 /v 7 m 2 m 2 J y 2 32 

since 5 (x) - f x^ 1 and g'(ar) - ^ilx"- 2 . Therefore, if 1 < a, 

1 r (smlmx) sin(2mx)\ , _ 2 > 

(-L-^ ^) 9 { X )dx = 0(m 2 ). 

In conclusion, for 1 < a we deduce, 

/ 27^ = C-4i(a) + + m-i) with C^a) := | / -^777^ > °- 

Jo sin (§) 8 Jo sin (§) 

Similarly, for 1 < a we deduce, 

/ ■ 2 ,;, 2 - <%(a) + OK- + m- 2 ) with C^(a) := ^ / = £ C^a) > 0. 

Jo sm z (|) 16 Jo sm z (|) 6 

□ 
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Model 


Estimates 


p — 5 


p = 10 


p = 15 


p = 20 


fGn (H = d + 1/2) 


VMSE d}£ H) 


0.088* 


0.094 


0.101 


0.111 




mean(mjv) 


11.8 


12.5 


16.0 


19.4 




proba 


0.93 


0.89 


0.86 


0.85 


FARIMA(0, d, 0) 


VMSE d!£ H) 


0.112 


0.099 


0.094* 


0.107 




mcan(m]v) 


13.9 


12.5 


14.6 


17.9 




proba 


0.94 


0.92 


0.88 


0.86 


FAR.IMA(1, d, 1) 


VMSE a% H) 


0.141 


0.136* 


0.140 


0.149 




mcanfmiv) 


15.2 


15.0 


18.2 


21.1 




proba 


0.94 


0.89 


0.86 


0.82 




VMSE d)$ H> 


0.122 


0.112* 


0.121 


0.123 




mcan(m w ) 


14.1 


13.8 


16.2 


20.0 




proba 


0.91 


0.90 


0.87 


0.85 



Model 


Estimates 


p — 5 


p = 10 


p = 15 


p = 20 


fGn (H = d+ 1/2) 


VMSE d% H> 


0.030 


0.022 


0.019 


0.018* 




mcan(mjv) 


13.7 


10.3 


9.4 


8.9 




proba 


0.95 


0.89 


0.87 


0.84 


FARIMA(0,d,0) 


VMSE <2^ H) 


0.039 


0.034 


0.033 


0.031* 




mean(mjv) 


11.5 


9.0 


8.0 


7.2 




proba 


0.95 


0.90 


0.88 


0.82 


FARIMA(1, d, 1) 


VMSE dV H) 


0.067 


0.062 


0.061* 


0.061* 




mean(mjv) 


18.1 


15.9 


13.8 


13.3 




proba 


0.95 


0.90 


0.84 


0.78 


x (d -e\ p = l 


VMSE a% H> 


0.071 


0.068 


0.067* 


0.071 




mean(mjv) 


15.2 


13.6 


11.7 


10.9 




proba 


0.92 


0.88 


0.85 


0.80 



Model 


Estimates 


p — 5 


p = 10 


p — 15 


p = 20 


fGn (H = d+ 1/2) 


VMSE di U > 


0.012 


0.008 


0.007 


0.006* 




mean (m at) 


14.0 


9.8 


6.9 


7.9 




proba 


0.92 


0.90 


0.87 


0.85 


FARIMA(0,d,0) 


VMSE d^ H) 


0.021 


0.019* 


0.019* 


0.019* 




mean(mjv) 


15.8 


12.7 


11.1 


9.8 




proba 


0.96 


0.94 


0.92 


0.89 


FARIMA(1, d, 1) 


VMSE d}£ H) 


0.039 


0.037 


0.035* 


0.035* 




mean (m at) 


25.7 


21.8 


21.4 


20.4 




proba 


0.98 


0.98 


0.94 


0.93 


X W>, 13 = 1 


VMSE d}£ H) 


0.042 


0.042 


0.040* 


0.041 




mcan(miv) 


22.3 


19.9 


19.7 


16.9 




pToba 


0.99 


0.97 


0.93 


0.90 



Table 1: \JMSE of the estimator , sample mean of the estimator and sample frequency that Tjv < 
g x 2( p _ 1 )(0.95) following p from simulations of the different processes of the benchmark. For each value of N 
(10 3 , 10 4 and 10 5 ), of d (-OA, -0.2, 0, 0.2 and OA) and p (5, 10, 15, 20), 100 independent samples of each 
process are generated. The values \J~MSE d^ R \ meanffhpf) and proba are obtained from sample mean on 
the different values of d. 
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N = 10 3 



Model 


V MSE 


d — —0.4 


d — —0.2 


d — 


d — 0.2 


d — 0.4 


fGn (H = d + 1/2) 


VMSE d MS 


0.102 


0.088 


0.094 * 


0.095 


0.098 




VMSE d R 


0.091 


0.108 


0.106 


0.117 


0.090 




\/MSE d w 


0.215 


0.103 


0.078 


0.073* 


0.061* 




VMSE d^ R) 


0.074* 


0.087* 


0.102 


0.084 


0.110 




VMSE d N (W) 


0.096 


0.135 


0.154 


0.158 


0.154 




VMSE d N (30) 


0.112 


0.192 


0.246 


0.270 


0.252 


FARIMA(0, d, 0) 


VMSB <2 M s 


0.096 


0.096 


0.098 


0.096 


0.093 




VMSE d R 


0.094 


0.113 


0.107 


0.112 


0.084 




VMSE d w 


0.069* 


0.073* 


0.074* 


0.082* 


0.085* 




VMSE <?^ B) 


0.116 


0.085 


0.103 


0.094 


0.101 




VMSE d N (10) 


0.139 


0.133 


0.148 


0.146 


0.156 




V MSE d N (30) 


0.157 


0.209 


0.232 


0.247 


0.243 


FARIMA(1, d, 1) 


VMSE d M s 


0.098 


0.092* 


0.089* 


0.088* 


0.094 




VMSE d R 


0.093* 


0.110 


0.115 


0.110 


0.089* 




\/MSE d w 


0.108 


0.120 


0.113 


0.117 


0.095 




VMSE d^ B) 


0.153 


0.131 


0.135 


0.138 


0.123 




VMSE d N (10) 


0.212 


0.188 


0.173 


0.157 


0.155 




VMSE d N (30) 


0.197 


0.228 


0.250 


0.265 


0.280 


X (D - D '\ D' = 1 


VMSE d MS 


0.092 


0.089* 


0.113* 


0.107* 


0.100* 




VMSE d R 


0.093 


0.111 


0.129 


0.124 


0.111 




VMSE d w 


0.217 


0.209 


0.211 


0.201 


0.189 




VMSE <?^ fl) 


0.075* 


0.101 


0.121 


0.122 


0.131 




VMSE d N (W) 


0.109 


0.143 


0.163 


0.168 


0.180 




VMSE djv(30) 


0.109 


0.177 


0.228 


0.249 


0.247 



N = 10 4 



Model 


VMSE 


d = -0.4 


d - -0.2 


d = 


d = 0.2 


d = 0.4 


fGn (H = d+ 1/2) 


VMSE d MS 
VMSE d R 
VMSE d w 
VMSE d^ R} 
VMSE SW(10) 
VMSE 5w(30) 


0.040 
0.040 
0.129 
0.019* 

0.036 
0.043 


0.031 
0.027 
0.045 
0.019* 
0.038 
0.070 


0.032 
0.029 
0.026 
0.017* 

0.049 
0.086 


0.035 
0.031 
0.022 
0.016* 

0.043 
0.081 


0.035 
0.030 
0.020 
0.019* 

0.048 
0.076 


FARIMA(0, d, 0) 


VMSE d MS 
VMSE d R 
VMSE d w 
VMSE d^ R} 
VMSE d N (W) 
VMSE d N (30) 


0.036 
0.031 
0.020* 
0.066 
0.076 
0.074 


0.030 
0.028 
0.018* 

0.031 
0.047 
0.085 


0.031 
0.027 
0.023 
0.018* 

0.043 
0.073 


0.035 
0.029 
0.025 
0.020* 
0.053 
0.086 


0.032 
0.029 
0.028* 
0.028* 

0.038 
0.073 


FARIMA(l,d, 1) 


VMSE d M s 
VMSE d R 
VMSE dw 
VMSE d^ R} 
VMSE djv(10) 
VMSE d N (30) 


0.035 
0.031* 

0.054 
0.099 
0.141 
0.111 


0.033 
0.029* 

0.054 
0.066 
0.095 
0.085 


0.032 
0.030* 

0.050 
0.052 
0.075 
0.094 


0.036 
0.032* 

0.052 
0.047 
0.055 
0.090 


0.031 
0.027* 

0.048 
0.046 
0.051 
0.074 


X (D - D '\ D' = 1 


VMSE d M s 
VMSE d R 
VMSE d w 
VMSE 3^ 
VMSE d N (W) 
VMSE d N (30) 


0.029 
0.032 
0.110 
0.018* 

0.035 
0.039 


0.037* 

0.041 
0.115 
0.064 
0.093 
0.088 


0.035* 

0.037 
0.115 
0.092 
0.102 
0.084 


0.041* 
0.041* 

0.112 
0.084 
0.106 
0.074 


0.038* 

0.039 
0.114 
0.081 
0.094 
0.077 



Table 2: Comparison of the different log-memory parameter estimators for processes of the benchmark. For 
each process and value of d and N, \fMSE are computed from 100 independent generated samples. 
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N = 10 3 



A/f r 

lVloclcl~pln.il ovstion 




d — —0 4 


— q 2 


d — 


d — 2 


d — 4 


FARIMA(0, d, 0) Uniform 


VMSfi d MS 


0.189 


0.090 


0.091 


0.082* 


0.092 




V Ai o hj an 


0.171 


0. 104 


0.109 


0. 102 






V M SE dw 


0.111* 


0.066* 


0.072* 


0.118 


0.129 




VMSE d)^ } 


0.186 


0.081 


0.083 


0.112 


0.093 


FARIMA(0, d, 0) Burr (a = 2) 


VMSE d M s 


0.174 


0.087 


0.092 


0.084 


0.091* 




V M o bj d R 


0.183 


0. 104 


0.097 


0.107 


0.079 




VMSE d w 


0.149* 


0.086* 


0.130 


0.101 


0.129 




VMSE d^ K) 


0.221 


0.119 


0.076* 


0.082* 


0.139 


FARIMA(0, d, 0) Burr (a = 3/2) 


VMSE d MS 


0.188 


0.087* 


0.063* 


0.099* 


0.075 




VMSE dn 


0.183* 


0.110 


0.079 


0.125 


0.072* 




VMSE dw 


0.219 


0.108 


0.138 


0.146 


0.159 




VMSE d^ H) 


0.264 


0.134 


0.094 


0.155 


0.187 


GARMA(0, d, 0) 


VMSE dMS 


0.149 


0.109 


0.086 


0.130 


0.172 




y M SE dn 


U.Uyo 


0. 104 


0.090 


0. 132 


0.125* 




VMSE d w 


0.117 


0.074* 


0.081* 


0.182 


0.314 




VMSE d% R) 


0.124 


0.121 


0.110 


0.102* 


0.331 


Trend 


VMSE d M s 


1.307 


0.891 


0.538 


0.290 


0.150 




VMSE d R 


0.900 


0.700 


0.498 


0.275 


0.087 




VMSE d w 


0.222* 


0.103* 


0.083 


0.071 


0.059* 




VMSE 


1.65 


0.223 


0.079* 


0.050* 


0.076 


Trend + Seasonality 


VMSE d M s 


1.178 


0.803 


0.477 


0.238 


0.123 




VMSE d R 


0.900 


0.700 


0.498 


0.284 


0.091* 




VMSE d w 


0.628* 


0.407* 


0.318 


0.274 


0.283 




VMSE d^ R) 


1.54 


1.01 


0.311* 


0.158* 


0.145 



N = 10 4 



Modcl+Innovation 


VMSE 


d - -0.4 


d - -0.2 


d = 


d = 0.2 


d = 0.4 


FARIMA(0, d, 0) Uniform 


VMSE d M s 


0.177 


0.039 


0.033 


0.034 


0.034 




VMSE d R 


0.171 


0.032 


0.030 


0.028 


0.032* 




VMSE d w 


0.125* 


0.027* 


0.025 


0.028 


0.035 




VMSE (?^ H) 


0.165 


0.042 


0.017* 


0.027* 


0.032* 


FARIMA(0, d, 0) Burr (a - 2) 


VMSE d MS 


0.180 


0.036 


0.041 


0.033 


0.032 




VMSE d a 


0.169 


0.031* 


0.030 


0.031* 


0.029* 




VMSE d w 


0.138* 


0.068 


0.065 


0.076 


0.066 




VMSE 


0.219 


0.067 


0.018* 


0.039 


0.074 


FARIMA(0, d, 0) Burr (a = 3/2) 


VMSE djv/s 


0.18 


0.038 


0.026* 


0.030 


0.021* 




VMSE d R 


0.174 


0.033* 


0.031 


0.023* 


0.023 




VMSE d w 


0.126* 


0.058 


0.149 


0.124 


0.090 




VMSE I-^ R) 


0.264 


0.113 


0.030 


0.099 


0.159 


GARMA(0. d, 0) 


VMSE djv/s 


0.063 


0.041 


0.028 


0.032 


0.060 




VMSE d R 


0.037* 


0.033* 


0.025 


0.026* 


0.030* 




VMSE d w 


0.061 


0.052 


0.021 


0.078 


0.081 




VMSE d^ K) 


0.074 


0.040 


0.016* 


0.055 


0.109 


Trend 


VMSE d MS 


1.16 


0.785 


0.450 


0.171 


0.072 




VMSE d R 


0.900 


0.700 


0.431 


0.192 


0.067 




VMSE dw 


0.135 


0.046 


0.021* 


0.019 


0.021 




VMSE t?^ H) 


0.019* 


0.021* 


0.021* 


0.016* 


0.020* 


Trend + Seasonality 


VMSE d M s 


1.219 


0.841 


0.474 


0.194 


0.099 




VMSE d R 


0.900 


0.700 


0.431 


0.189 


0.063 




VMSE d w 


0.097* 


0.073* 


0.063 


0.065 


0.051 




VMSE d^ H) 


0.671 


0.382 


0.049* 


0.047* 


0.041* 



Table 3: Comparison of the different log-memory parameter estimators for processes of the benchmark. For 
each process and value of d and N , \J MSE are computed from 100 independent generated samples. 
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